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Transcription factor 

TECHNICAL FIELD 

5 The present invention relates generally to methods and materials 
based on transcription factors involved in sugar signalling in 
plants . 

BACKGROUND ART 

10 

Development of the cereal seed is orchestrated by the coordinated 
activities of a large number of genes that encode metabolic and 
regulatory enzymes, as well as other proteins (Olsen, 2001) . This 
results in a triploid endosperm, the embryo, pericarp, seed coat 
15 and other tissues of the mature grain. The endosperm structure 

consists of two tissues, the interior starch-filled endosperm and 
the outer epidermal layer called the aleurone. 

Starch, which is a mixture of amylopectin (a heavily branched 
20 polyglucan) and amylose (a mostly linear polyglucan) , is deposited 
in the endosperm as granules. The synthesis and deposition of 
starch in the endosperm depend on enzymes such as ADP-glucose 
pyrophosphorylase (AGPase) , starch synthases (SS) , starch branching 
enzymes (SBE) , and starch debranching enzymes (DBE) . Most, if not 
25 all, of these enzymes exist in two or more isoforms (see Ball et 

al., 1998/ Buleon et al . , 1998; Myers et al . , 2000; Nakamura, 2002; 
Smith, 2001; for recent reviews on starch biosynthesis) . DBEs are 
grouped into two distinct classes, isoamylase and pullulanase each 
with different isoforms (Nakamura, 1996). Isoamylase (EC 3.2.1.68) 
30 is an essential enzyme in amylopectin synthesis. However, the 

precise role of the enzyme in this process is still not clear, and 
different models have been proposed (Ball et al., 1998; Nakamura, 
2002; Smith, 2001) . 

35 It has been reported that expression of starch synthesis genes, 

i.e. SS in potato (Visser et'al., 1991), ADPGase in potato (Mul'ler- 
Rdber et al . , 1990), sweet potato (Bae and Liu, 1997), AralDidopsis 
(Rook et al., 2001) and tomato (Li et al, 2002), SBE in potato 
(Kossman et al., 1991), maize (Kim and Guiltinan, 1999) and 



Arabidopsis (Khoshnoodi et al., 1998), and isoamylase in barley 
(Sun et al., 1999), are sugar inducible. In contrast to the 
situation in bacteria, yeast and mammals, where sugar signaling 
cascades are extensively studied, the sugar signaling transduction 
pathways in plants are poorly understood (Rolland et al . , 2002). 
Generally, in higher plants, high sugar levels stimulate expression 
of genes involved in sink function, such as growth, storage of 
proteins and biosynthesis of starch and other carbohydrates, 
whereas low sugar levels promote photosynthesis and mobilization of 
energy reserves, such as breakdown of storage starch or lipids. 

Sugar signaling can be dissected into three steps, sugar sensing, 
signal transduction, and target gene expression. The picture is 
clouded by the dual function of sugars as nutrients and signaling 
molecules, and by the interaction (in plants and animals) between 
sugar signaling and hormonal networks. In plants the complexity is 
further increased by the vital role of sugar production through 
photosynthesis. Hexoses, sucrose and trehalose might serve as 
elicitors of plant sugar signaling (Goddijn and Smeekens, 1998; 
Rolland et al . , 2002). Hexo kinase, sucrose and glucose 
transporters, and various sugar receptors, have been proposed as 
components of the sugar sensing machinery (Rolland et al . , 2002; 
Sheen et al . , 1999; Smeekens, 2000). 

The Ser/Thr protein kinase Snfl is a central participant in yeast 
sugar signaling (Carlson, 1999) . Snfl phosphorylates downstream 
components and is also itself activated by phosphorylation. Snf 
related protein kinases (SnRKs) are found in yeast, mammals and 
plants, where they partake in a large number of regulatory 
functions (Halford and Hardie, 1998; Hardie et al., 1998). There is 
evidence that some plant SnRKs share functional homology with Snfl 
in plant sugar signaling, although the exact nature of their 
responses to sugars remains to be clarified (Rolland et al . , 2002). 
Other players implicated in sugar signaling transduction .pathways 
in plants are sugar metabolites, 14-3-3 proteins, trehalose- 6- 
phosphate, and Ca 2+ (Rolland et al . , 2002). 

Little is known about the cis and trans factors mediating the final 
steps in plant sugar signaling. To date, five different types of- 



cis elements have been identified in sugar-regulated plant 
promoters; the SURE (Grierson et al., 1994), SP8 (Ishiguro and 
Nakamura, 1994), TGGACGG (Maeo et al . , 2001), G box (Giuliano et 
al., 1988) and B box (Grierson et al., 1994; Zourelidou et al . , 
2002) elements. 

Only two putative transcription factors, SPFl and STK, with 
relevance to plant sugar signaling have been isolated. SPFl was 
isolated from sweet potato, and it was reported that SPFl binds to 
the SP8a and SP8b promoter elements of the ft-amylase and sporamin 
genes of tuberous roots, where it functions as a repressor 
(Ishiguro and Nakamura, 1994) . STK binds to the B box as an 
activator (Zourelidou et al . , 2002). 

It can thus be seen that novel transcription factors, for instance 
those which could be used to affect starch synthesis or deposition 
(and hence structure) may provide a useful contribution to the art. 

DISCLOSURE OF THE INVENTION 

The present inventors have isolated a transcription factor, SUSIBA2 
(sugar signaling in barley) , and examined its interaction with 
endosperm specific expression of the sbellb (encoding SBEIIb) and 
isol (encoding isoamylase 1) genes during barley seed development. 

Antibodies against SUSIBA2 were produced and the expression pattern 
for susiba2 was determined at the RNA and protein levels. It was 
found that susiba2 is expressed in endosperm but not leaves. 
Transcription of susiba2 is sugar inducible and ectopic susiba2 
expression was obtained in sugar-treated leaves. The temporal 
expression of susiba2 in barley endosperm followed that of isol and 
endogenous sucrose levels, with a peak at around 12 days after 
pollination. 

It was previously known that barley isol harbors an SP8a element 
and that it contributes to the endosperm specificity of isol 
expression by recruiting a repressor in non-expressing tissues (Sun 
et al., 1999). Similarly, the endosperm-specific expression of 
barley sJbeJJjb (Sun et al . , 1998) is partly determined by a 



repressor-binding B box element, Bbl, in the second intron of the 
gene (Ahlandsberg et al., 2002b). 

Interestingly SUSIBA2 appears to bind, as an activator, to the SURE 
5 (sugar responsive) element in the Isol promoter. It also binds the 
W box elements but not to the SP8a element. A novel application of 
a transcription factor oligodeoxynucleotide decoy strategy with 
transformed barley endosperm was used to provide experimental 
evidence for the importance of the SURE element in isol 
10 transcription. 

Thus, not only is its biding specificity different to SPF1 and STK 
described above, but SUSIBA2 also represents the first i'solated 
SURE-binding transcription factor, the first WRKY protein known to 
15 be involved in carbohydrate anabolism, and the first transcription 
factor of any sort reported for regulation of starch synthesis. 
It shares only a very low degree of sequence identity (28%) with 
SPF1. Orthologs to SUSIBA2 were also isolated from rice and wheat 
endosperm. 

20 

The sequences from the rice and wheat cDNAs, and wheat peptide are 
annexed hereto . 

Thus in a first aspect of the invention there is disclosed an 
25 isolated nucleic acid which comprises, or consists essentially of, 
a nucleotide sequence which encodes a transcription factor which is 
capable of modulating the activity of a promoter of a gene encoding 
an enzyme involved in the synthesis or deposition of starch. 

30 "Nucleic acid" and "nucleic acid molecule" have the same meaning. 
As stated above, the nucleic acid may consist essentially of a 
nucleotide sequence of the present invention (which is to say that 
the sequence is ^of the essence of' the molecule, generally making 
up more than 50% of 'it) . 

35 

The promoter will preferably include at least one SURE element (as 
described in more detail hereinafter - see Example 7 in particular) 
and\or W box element and the transcription factor will bind to one 
or more of these. 
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The enzyme (and hence promoter) may optionally be selected from the 
following genes: Isol, sbel, sbellb, ssl f agpaseS. 

5 Preferably the transcription factor is a WRKY protein which is 

involved in carbohydrate anabolism. More specifically it modulates 
the activity of the promoter within a plant in response to sugar 
levels in the plant i.e. the factor is a component of a sugar 
signaling pathway which is responsible for modulating target gene 
10 expression. 

Preferably the nucleotide sequence is a SUSIBA2 nucleotide sequence 
which encodes the 573-amino acid sequence given in Figure 1. 

15 Preferably the nucleotide sequence is a SUSIBA2 nucleotide sequence 
which consists of the coding sequence given in Figure 1 or one 
which is degeneratively equivalent or complementary thereto. The 
length of the sus±ba2 cDNA is 2355 nucleotides, and the open 
reading frame starts at position 247 and ends at position 1966 

20 (Figure 1) . 

In alternative embodiments the nucleotide sequence is a SUSIBA2 
nucleotide sequence which encodes an amino acid sequence given in 
the Sequence Annex. In preferred alternative embodiments the 

25 nucleotide sequence is a SUSIBA2 nucleotide sequence which 

comprises a coding sequence given in the Sequence Annex or one 
which is degeneratively equivalent or complementary thereto. Where 
aspects or embodiments of the invention are discussed in relation 
to the barley sequence given in Figure 1, it will be understood 

30 that each of those aspects or embodiments applies mutatis mutandis 
to the wheat or rice orthologues, as described in the Sequence 
Annex . 

The nucleic acid molecules or vectors (see below) according to the 
35 present invention may be provided isolated and/or purified from 
their natural environment, in substantially pure or homogeneous 
form, or free or substantially free of nucleic acid or genes of the 
species of interest or origin other than the sequence encoding a 
polypeptide with the required function. The term "isolated" 



encompasses all these possibilities. 



Nucleic acid according to the present invention may include cDNA, 
RNA, genomic DNA and modified nucleic acids or nucleic acid 
analogs. Where a DNA sequence is specified, e.g. with reference to 
a figure, unless context requires otherwise the RNA equivalent, 
with U substituted for T where it occurs, is encompassed. 

Complement sequences of those discussed herein are also 
encompassed. As is well understood by those skilled in the art, two 
nucleic acid nucleotide sequences are "complementary" when one will 
properly base pair with all or part of the other according to the 
standard rules (G pairs with C, and A pairs with T) . One sequence 
is "the complement" of another where those sequences are of the 
same length, but are complementary to each other. 

In a further aspect of the present invention there is disclosed a 
nucleic acid molecule comprising (or preferably consisting 
essentially of) a nucleotide sequence which is a variant of the 
barley SUSIBA2 nucleotide sequence of Figure 1. 

Thus the invention provides, inter alia, an isolated transcription 
factor gene derivable from (and which can be expressed 
recombinantly in) maturing seeds, and which encodes a transcription 
factor protein which targets a promoter of a gene encoding an 
enzyme involved in the synthesis or deposition (anabolism) of 
starch. Such genes may encode SUSIBA2, or are related or derived 
therefrom e.g. genes which encode an RNA which hybridizes to the 
SUSIBA2 gene under high stringency conditions. The transcription 
factors may bind a SURE and\or W box element in the promoter, and 
optionally not bind the SP8a element. 

The gene may be present in an expression cassette, and the 
transcription factor of the invention may be used, for example, in 
a method for enhancing or reducing expression of a starch anabolic 
enzyme, which method may involve transforming a seed crop plant 
with the factor gene, said plant thereby expressing the 
transcription factor protein encoded by said transcription factor 
gene. This may be used to give enhanced, reduced, or altered 
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starch synthesis. More specifically the disclosure of these 
sequences provides a novel mechanism for manipulating the starch 
anabolism activity in plants in a number of important respects. 
These include, Inter alia, the ability to: modulate the activity of 
5 isoamylases and other enzymes in plants; alter responsiveness of 
this pathway to sugar levels; alter overall debranching enzyme 
activity; alter such enzyme activities in various different tissues 
or subcellular compartments or at various different developmental 
stages and produce novel starch types in transgenic line. 

10 

Some aspects and embodiments of the invention will now be discussed 
in more detail. 

SUSIBA2 Variants 

15 

Variants of the present invention can be artificial nucleic acids 
(i.e. containing sequences which have not originated naturally) 

which can be prepared by the skilled person in the light of the 

present disclosure. Artificial variants (derivatives) may be 
20 prepared by those skilled in the art, for instance by site directed 

pr random mutagenesis, or by direct synthesis. Preferably the 

variant nucleic acid is generated either directly or indirectly 
(e.g. via one or amplification or replication steps) from an 

original nucleic acid having all or part of the sequences of the 
25 first aspect. Preferably the variant encodes a product which has 

one or more of the transcription factor activities discussed above. 

Alternatively they may be novel, naturally occurring, nucleic 
acids, isolatable using the sequences of the present invention. 
30 Sequence variants which occur naturally may also include alleles 
(which will include polymorphisms or mutations at one or more 
bases) . Preferred nucleic acids are those encoding orthologs from 
any of rice, wheat, or potato or derivatives of these as described 
in more detail below. 

35 

Thus a variant may be or include a distinctive part or fragment 
(however produced) corresponding to a portion of the sequence 
provided. These portions may include motifs which are distinctive 
to SUSIBA2 sequences, such motifs being discussed in more detail in 
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the Examples below. 

Fragments may encode or omit particular functional parts of the 
polypeptide. Equally the fragments may have utility in probing for, 
or amplifying, the sequence provided or closely related ones. 
Suitable lengths of fragment, and conditions, for such processes 
are discussed in more detail below. Also included are nucleic acids 
which have been extended at the 3 1 or 5' terminus with respect to 
those of the first aspect. 

The term "variant" nucleic acid as used herein encompasses all of . 
these possibilities. When used in the context of polypeptides or 
proteins it indicates the encoded expression product of the variant 
nucleic acid. 

Some of the aspects of the present invention relating to variants 
will now be discussed in more detail. 

Homology (either similarity or identity) may be as defined and 
determined by the TBLASTN program, of Altschul et al- (1990) J. 
Mol. Biol. 215: 403-10, which is in standard use in the art, or, 
and this may be preferred, the standard program BestFit, which is 
part of the Wisconsin Package, Version 8, September 1994, (Genetics 
Computer Group, 575 Science Drive, Madison, Wisconsin, USA, 
Wisconsin 53711) . BestFit makes an optimal alignment of the best 
segment of similarity between two sequences. Optimal alignments 
are found by inserting gaps to maximize the number of matches using 
the local homology algorithm of Smith and Waterman. 

Homology, with respect to SUSIBA2 in Figure 1 may be at the 
nucleotide sequence and/or encoded amino acid sequence level. 
Preferably, the nucleic acid and/or amino acid sequence shares at 
least about 50%, or 60%, or -70%, or 80% homology, most preferably 
at least about 90%, 95%, 96%, 97%, 98% or 99% similarity or 
identity. 

Thus a variant polypeptide in accordance with the present invention 
may include within an amino acid sequences described herein a 
single amino acid or 2, 3, 4, 5, 6, 7, 8, or 9 changes, about 10, 



15, 20, 30, 40 or 50 changes, or greater than about 50, 60, 70, 80, 
90, 100 or 115 changes. In addition to one or more changes within 
the amino acid sequence shown, a variant polypeptide may include 
additional amino acids at the C-terminus and/or N-terminus - 

Thus in a further aspect of the invention there is disclosed a 
method of producing a derivative nucleic acid comprising the step 
of modifying the coding sequence of a nucleic acid comprising any 
one the sequences discussed above. 

Changes to a sequence, to produce a derivative, may be by one or 
more of addition, insertion, deletion or substitution of one or 
more nucleotides in the nucleic acid, which may lead to the 
addition, insertion, deletion or substitution of one or more amino 
acids in the encoded polypeptide. Changes may be desirable for a 
number of reasons, including introducing or removing the following 
features: restriction endonuclease sequences; codon usage; other 
sites which are required for post translation modification; 
cleavage sites in the encoded polypeptide; motifs in the encoded 
polypeptide (e.g. binding sites) . Leader or other targeting 
sequences (e.g. the putative TM region) may be added or removed 
from the expressed protein to determine its location following 
expression. All of these may assist in efficiently cloning and 
expressing an active polypeptide in recombinant form (as described 
below) . 

Other desirable mutation may be random or site directed mutagenesis 
in order to alter the activity (e.g. specificity) or stability of 
the encoded polypeptide. Changes may be by way of conservative 
variation, i.e. substitution of one hydrophobic residue such as 
isoleucine, valine, leucine or methionine for another, or the 
substitution of one polar residue for another, such as arginine for 
lysine, glutamic for aspartic acid, or glutamine for asparagine. As 
is well known to those skilled in the art, altering the primary 
structure of a polypeptide by a conservative substitution may not 
significantly alter the activity of that peptide because the 
side-chain of the amino acid which is inserted into the sequence 
may be able to form similar bonds and contacts as the side chain of 
the amino acid which has been substituted out. This is so even when 
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the substitution is in a region which is critical in determining 
the peptides conformation. Also included are variants having 
non-conservative substitutions. As is well known to those skilled 
in the art, substitutions to regions of a peptide which are not 
critical in determining its conformation may not greatly affect its 
activity because they do not greatly alter the peptide's three 
dimensional structure. 

In regions which are critical in determining the peptides 
conformation or activity such changes may confer advantageous 
properties on the polypeptide. Indeed, changes such as those 
described above may confer slightly advantageous properties on the 
peptide e.g. altered stability or specificity. 

Other methods for generating novel specificities may include mixing 
or incorporating sequences from related SUSIBA genes into the 
sequences disclosed herein. For example restriction enzyme 
fragments of related genes could be ligated together. An 
alternative strategy for modifying SUSIBA2 sequences would employ 
PGR as described below (Ho et al., 1989, Gene 77 , 51-59) or DNA 
shuffling (Crameri et al . , 1998, Nature 391). 

In a further aspect of the present invention there is provided a 
method of detecting, identifying and/or cloning (isolating) a 
nucleic acid of the present invention (e.g. a homologue of the 
sequences set out hereinafter) from a plant which method employs 
any of the sequences of the invention discussed above. In 
particular the methods will generally employ primers or probes 
derived from all or part of these sequences (or sequences 
complementary thereto) set out herein. 

An oligonucleotide primer for use in amplification reactions may be 
about 30 or fewer nucleotides in length. Generally specific 
primers are upwards of 12, 13, 14, 15, 18, 21 or 24 nucleotides in 
length. For optimum specificity and cost effectiveness, primers of 
16-24 nucleotides in length may be preferred. 

An oligonucleotide or polynucleotide probe may be based on the any 
of the sequences disclosed herein (e.g. introns or exons, although 



the latter may be preferred) . If required, probing can be done with 
entire restriction fragments of the genes which may be 100' s or 
even 1000' s of nucleotides in length. 

Those skilled in the art are well versed in the design of primers 
for use processes such as PCR. The primers will usually be based 
on sequences which are peculiar or unique to the SUSIBA2 sequences 
(see e.g. discussion of C-terminal sequence in the Examples below). 

When using such probes or primers, if need be, clones or fragments 
identified in the search can be extended. For instance if it is 
suspected that they are incomplete, the original DNA source (e.g. a 
clone library, mRNA preparation etc.) can be revisited to isolate 
missing portions e.g. using sequences, probes or primers based on 
that portion which has already been obtained to identify other 
clones containing overlapping sequence. 

In one embodiment, a variant in accordance with the present 
invention is also obtainable by means of a method which includes: 
(a) providing a preparation of nucleic acid, e.g. from plant cells, 
. (b) providing a probe or primer as discussed above, 
(c) contacting nucleic acid in said preparation with said nucleic 
acid molecule under conditions for hybridisation of said nucleic 
acid molecule to any said gene or homologue in said preparation, 
and identifying said gene or homologue if present by its 
hybridisation with said nucleic acid molecule. 

Plants which may be a suitable source of SUSIBA2 may include any 
cereal, and in particular the developing seeds of such cereals. 

Probing may employ the standard Southern blotting technique. For 
instance DNA may be extracted from cells and digested with 
different restriction enzymes. Restriction fragments may then be 
separated by electrophoresis on an agarose gel, before denaturation 
and transfer to a nitrocellulose filter. Labelled probe may be 
hybridised to the DNA fragments on the filter and binding 
determined. DNA for probing may be prepared from RNA preparations 
from cells. 



12 

Test nucleic acid may be provided from a cell as genomic DNA, cDNA 
or RNA, or a mixture of any of these, preferably as a library in a' 
suitable vector. If genomic DNA is used the probe may be used to 
identify untranscribed regions of the gene (e.g. promoters etc.), 
such as is described hereinafter. Probing may optionally be done by 
means of so-called "nucleic acid chips" (see Marshall & Hodgson 
(1998) Nature Biotechnology 16: 27-31, for a review) . 

Preliminary experiments may be performed by hybridising under low 
stringency conditions. For probing, preferred conditions are those 
which are stringent enough for there to be a simple pattern with a 
small number of hybridisations identified as positive which can be 
investigated further. 

For instance, screening may initially be carried out under 
conditions, which comprise a temperature of about 37 °C or less, a 
formamide concentration of less than about 50%, and a moderate to 
low salt (e.g. Standard Saline Citrate ("SSC") - 0.15 M sodium 
chloride; 0.15 M sodium citrate; pH 7) concentration. 

Alternatively, a temperature of about 50 °C or less and a high salt 
(e.g. "SSPE" 0.180 mM sodium chloride; 9 mM disodium hydrogen 
phosphate; 9 mM sodium dihydrogen phosphate; 1 mM sodium EDTA; pH 
7^4). Preferably the screening is carried out at about 37°C, a 
formamide concentration of about 20%, and a salt concentration of 
about 5 X SSC, or a temperature of about 50 °C and a salt 
concentration of about 2 X SSPE. These conditions will allow the 
identification of sequences which have a substantial degree of 
homology (similarity, identity) with the probe sequence, without 
requiring the perfect homology for the identification of a stable 
hybrid. 

Suitable conditions include, e.g. for detection of sequences that 
are about 80-90% identical, hybridization overnight at 42°C in 
0.25M Na 2 HP0 4 , pH 7.2, 6.5% SDS, 10% dextran sulfate and a final 
wash at 55 °C in 0 . IX SSC, 0.1% SDS. For detection of sequences 
that are greater than about 90% identical, suitable conditions 
include hybridization overnight at 65 °C in 0.25M Na 2 HP0 4 , pH 7.2, 
6.5% SDS, 10% dextran sulfate and a final wash at 60 °C in 0 . IX SSC, 
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0.1% SDS. 

It is well known in the art to increase stringency of hybridisation 
gradually until only a few positive clones remain. Suitable 
conditions would be achieved when a large number of hybridising 
fragments were obtained while the background hybridisation was low. 
Using these conditions nucleic acid libraries, e.g. cDNA libraries 
representative of expressed sequences, may be searched. Those 
skilled in the art are well able to employ suitable conditions of 
the desired stringency for selective hybridisation, taking into 
account factors such as oligonucleotide length and base 
composition, temperature and so on. 

Binding of a probe to target nucleic acid (e.g. DNA) may be 
measured using any of a variety of techniques at the disposal of 
those skilled in the art. For instance, probes may be 
radioactively, f luorescently or enzymatically labelled. Other 
methods not employing labelling of probe include amplification 
using PCR (see below) or RNAase cleavage. The identification of 
successful hybridisation is followed by isolation of the nucleic 
acid which has hybridised, which may involve one or more steps of 
PCR or amplification of a vector in a suitable host. 

In a further embodiment, hybridisation of nucleic acid molecule to 
a variant may be determined or identified indirectly, e.g. using a 
nucleic acid amplification reaction, particularly the polymerase 
chain reaction (PCR) . PCR requires the use of two primers to 
specifically amplify target nucleic acid, so preferably two nucleic 
acid molecules with sequences characteristic of are employed. Using 
RACE PCR, only one such primer may be needed (see "PCR protocols; A 
Guide to Methods and Applications", Eds. Innis et al, Academic 
Press, New York, (1990)). 

Thus a method involving use of PCR in obtaining nucleic acid 
according to the present invention may be carried out as described 
above, but using a pair of nucleic acid molecule primers useful in 
(i.e. suitable for) PCR. 

The methods described above may also be used to determine the 
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presence of one of the nucleotide sequences of the present 
invention within the genetic context of an individual plant, 
optionally a transgenic plant which may be produced as described in 
more detail below. This may be useful in plant breeding programmes 
e.g. to directly select plants containing alleles which are 
responsible for desirable traits in that plant species, either in 
parent plants or in progeny (e.g. hybrids, Fl, F2 etc.). Thus use 
of the markers defined in the Examples below, or markers which can 
be designed by those skilled in the art on the basis the nucleotide 
sequence information disclosed herein, forms one part of the 
present invention . 

As used hereinafter, unless the context demands otherwise, the term 
"SUSIBA2 nucleic acid" is intended to cover any of the nucleic 
acids of the invention described above, including functional 
variants . 

SUSIBA2 Vectors and Transformation 

In one aspect of the present invention, the SUSIBA2 nucleic acid 
described above is in the form of a recombinant and preferably 
replicable vector. 

"Vector" is defined to include, inter alia, any plasmid, cosmid, 
phage or AgroJbacterium binary vector in double or single stranded 
linear or circular form which may or may not be self transmissible 
or mobilizable, and which can transform prokaryotic or eukaryotic 
host either by integration into the cellular genome or exist 
extrachromosomally (e.g. autonomous replicating plasmid with an 
origin of replication) . 

Specifically included are shuttle vectors by which is meant a DNA 
vehicle capable, naturally or by design, of replication in two 
different host organisms, which may be selected from actinomycetes 
and related species, bacteria and eucaryotic (e.g. higher plant, 
yeast or fungal cells) . 

A vector including nucleic acid according to the present invention 
need not include a promoter or other regulatory sequence, 
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particularly if the vector is to be used to introduce the nucleic 
acid into cells for recombination into the genome. 

Preferably the nucleic acid in the- vector is under the control of, 
and operably linked to, an appropriate promoter or other regulatory 
elements for transcription in a host cell such as a microbial, e.g. 
bacterial, or plant cell. The vector may be a bi-functional 
expression vector which functions in multiple hosts. In the case of 
genomic DNA, this may contain its own promoter or other regulatory 
elements and in the case of cDNA this may be under the control of 
an appropriate promoter or other regulatory elements for expression 
in the host cell 

By "promoter" is meant a sequence of nucleotides from which 
transcription may be initiated of DNA operably linked downstream 
(i.e. in the 3 1 direction on the sense strand of double-stranded 
DNA) . 

"Operably linked" means joined as part of the same nucleic acid 
molecule, suitably 'positioned and oriented for transcription to be 
initiated from the promoter. DNA operably linked to a promoter is 
"under transcriptional initiation regulation' 1 of the promoter. 

Thus this aspect of the invention provides a gene construct, 
preferably a replicable vector, comprising a promoter operatively 
linked to a nucleotide sequence provided by the present invention. 

Generally speaking, those skilled in the art are well able to 
construct vectors and design protocols for recombinant gene 
expression. Suitable vectors can be chosen or constructed, 
containing appropriate regulatory sequences, including promoter 
sequences, terminator fragments, polyadenylation sequences, 
enhancer sequences, marker genes and other sequences as 
appropriate. For further details see, for example, Molecular 
Cloning: a Laboratory Manual: 2nd edition, Sambrook et al, 1989, 
Cold Spring Harbor Laboratory Press. 

Many known techniques and protocols for manipulation of nucleic 
acid, for example in preparation of nucleic acid constructs, 
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mutagenesis (see above discussion in respect of variants) , 
sequencing, introduction of DNA into cells and gene expression, and 
analysis of proteins, are described in detail in Current Protocols 
in Molecular Biology, Second Edition, Ausubel et al . eds., John 
Wiley & Sons, 1992. The disclosures of Sambrook et al- and Ausubel 
et al. are incorporated herein by reference. 

In one embodiment of this aspect of the present invention, there is 
provided a gene construct, preferably a replicable vector, 
comprising an inducible promoter operatively linked to a nucleotide 
sequence provided by the present invention. 

The term "inducible" as applied to a promoter is well understood by 
those skilled in the art. In essence, expression under the control 
of an inducible promoter is "switched on" or increased in response 
to an applied stimulus. The nature of the stimulus varies between 
promoters. Some inducible promoters cause little or undetectable 
levels of expression (or no expression) in the absence of the 
appropriate stimulus. Other inducible promoters cause detectable 
constitutive expression in the absence of the stimulus. Whatever 
the level of expression is in the absence of the stimulus, 
expression from any inducible promoter is increased in the presence 
of the correct stimulus. 

Particular of interest in the present context are nucleic acid 
constructs which operate as plant vectors. Specific procedures and 
vectors previously used with wide success upon plants are described 
by Guerineau and Mullineaux (1993) (Plant transformation and 
expression vectors. In: Plant Molecular Biology Labfax (Croy RRD 
ed) Oxford, BIOS Scientific Publishers, pp 121-148) . 

Suitable promoters which operate in plants include the Cauliflower 
Mosaic Virus 35S (CaMV 35S) . Other examples are disclosed at pg 
120 of Lindsey & Jones (198 9) "Plant Biotechnology in Agriculture" 
Pub. OU Press, Milton Keynes, UK. The promoter may be selected to 
include one or more sequence motifs or elements conferring 
developmental and/or tissue-specific regulatory control of 
expression. Inducible plant promoters include the ethanol induced 
promoter of Caddick et al (1998) Nature Biotechnology 16: 177-180. 
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If desired, selectable genetic markers may be included in the 
construct, such as those that confer selectable phenotypes such as 
resistance to antibiotics or herbicides (e.g. kanamycin, 
hygromycin, phosphinotricin, chlorsulf uron, methotrexate, 
gentamycin, spectinomycin, imidazolinones and glyphosate) . 

The present invention also provides methods comprising introduction 
of such a construct into a host cell, particularly a plant cell. 

In a further aspect of the invention, there is disclosed, a host 
cell containing a heterologous nucleic acid or construct according 
to the present invention, especially a plant or a microbial cell. 

The term "heterologous" is used broadly in this aspect to indicate 
that the SUSIBA2 nucleic acid in question has been introduced into 
said cells of the plant or an ancestor thereof, using genetic 
engineering, i.e. by human intervention. A heterologous gene may 
replace an endogenous equivalent gene, i.e. one which normally 
performs the same or a similar function, or the inserted sequence 
may be additional to the endogenous gene or other sequence. 

Nucleic acid heterologous to a plant cell may be non-naturally 
occurring in cells of that type, variety or species. Thus the 
heterologous nucleic acid may comprise a coding sequence of or 
derived from a particular type of plant cell or species or variety 
of plant, placed within the context of a plant cell of a different 
type or species or variety of plant. A further possibility is for 
a nucleic acid sequence to be placed within a cell in which it or a 
homolog is found naturally, but wherein the nucleic acid sequence 
is linked and/or adjacent to nucleic acid which does not occur 
naturally within the cell, or cells of that type or species or 
variety of plant, such as operably linked to one or more regulatory 
sequences, such as a promoter sequence, for control of expression. 

The host cell (e.g. plant cell) is preferably transformed by the 
construct, which is to say that the construct becomes established 
within the cell, altering one or more of the cell's characteristics 
and hence phenotype e.g. with respect to sugar sensing or starch 
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anabolism. 

Nucleic acid can be transformed into plant cells using any suitable 
technology, such as a disarmed Ti-plasmid vector carried by 
Agrobacterium exploiting its natural gene transfer ability (EP-A- 
270355, EP-A-0116718, NAR 12(22) 8711 - 87215 1984),. particle or 
microprojectile bombardment (US 5100792, EP-A-444882, EP-A-434616) 
microinjection (WO 92/09696, WO 94/00583, EP 331083, EP 175966, 
Green et al. (1987) Plant Tissue and Cell Culture, Academic Press), 
electroporation (EP 290395, WO 8706614 Gelvin Debeyser) other forms 
of direct DNA uptake (DE 4005152, WO 9012096, US 4684611), liposome 
mediated DNA uptake (e.g. Freeman et al. Plant Cell Physiol. 29: 
1353 (1984)), or the vortexing method (e.g. Kindle, PNAS U.S.A. 87: 
1228 (1990d) Physical methods f or . the transformation of plant cells 
are reviewed in Oard, 1991, Biotech. Adv. 9: 1-11. 

Agrobacterium transformation is widely used by those skilled in- the 
art to transform dicotyledonous species. Recently, there has also 
been substantial progress towards the routine production of stable, 
fertile transgenic plants in almost all economically relevant 
monocot plants (see e.g. Hiei et al. (1994) The Plant Journal 6, 
271-282) ) . Microprojectile bombardment, electroporation and direct 
DNA uptake are preferred where Agrobacterium alone is inefficient 
or ineffective. Alternatively, a combination of different 
techniques may be employed to enhance the efficiency of the 
transformation process, eg bombardment with Agrobacterium coated 
microparticles (EP-A-48 6234 ) or microprojectile bombardment to 
induce wounding followed by co-cultivation with Agrobacterium (EP- 
A-486233) . 

The particular choice of a transformation technology will be 
determined by its efficiency to transform certain plant species as 
well as the experience and preference of the person practising the 
invention with a particular methodology of choice. It will be 
apparent to the skilled person that the particular choice of a 
transformation system to introduce nucleic acid into plant cells is 
not essential to or a limitation of the invention, nor is the 
choice of technique for plant regeneration. 



Thus a further aspect of the present invention provides a method of 
transforming a plant cell involving introduction of a construct as 
described above into a plant cell and causing or allowing 
recombination between the vector and the plant cell genome to 
introduce a nucleic acid according to the present invention into 
the genome. 

The invention further encompasses a host cell transformed with 
nucleic acid or a vector according to the present invention 
especially a plant or a microbial cell. In the transgenic plant 
cell (i.e. transgenic for the nucleic acid in question) the 
transgene may be on an extra-genomic vector or incorporated,, 
preferably stably, into the genome. There may be more than one 
heterologous nucleotide sequence per haploid genome. 

Generally speaking, following transformation, a plant may be 
regenerated, e.g. from single cells, callus tissue or leaf discs, 
as is standard in the art. Almost any plant can be entirely 
regenerated from cells, tissues and organs of the plant. Available 
techniques are reviewd in Vasil et al . , Cell Culture and Somatic 
Cell Genetics of Plants, Vol I, II and III , Laboratory Procedures 
and Their Applications , Academic Press, 1984, and Weissbach and 
Weissbach, Methods for Plant Molecular Biology, Academic Press, 
1989. 

The generation of fertile transgenic plants has been achieved in 
the cereals rice, maize, wheat, oat, and barley (reviewed in 
Shimamoto, K. (1994) Current Opinion in Biotechnology 5, 158-162./ 
Vasil, et al. (1992) Bio/ Technology 10, 667-674; Vain et al . , 1995, 
Biotechnology Advances 13 (4): 653-671; Vasil, 1996, Nature 
Biotechnology 14 page 702) . 

Plants which include a plant cell according to the invention are 
also provided. 

In addition to the regenerated plant obtainable by the above 
method, the present invention embraces all of the following: a 
clone of such a plant; selfed or hybrid progeny; descendants (e.g. 
Fl and F2 descendants) and any part of any of these. The invention 



also provides a plant propagule from such plants, that is any part 
which may be used in reproduction or propagation, sexual or 
asexual, including cuttings, and so on. In each case these 
embodiments will include a heterologous SUSIBA2 nucleic acid 
according to the present invention. 

Uses of SUSIBA2 nucleic acids and transcription factors 

The present invention provides a method of influencing the starch 
anabolism in a plant, the method including the step of causing or 
allowing expression of the product (polypeptide or nucleic acid 
transcript) encoded by heterologous nucleic acid according to the 
invention from that nucleic acid within cells of the plant. 

The step may be preceded by the earlier step of introduction of the 
nucleic acid into a cell of the plant or an ancestor thereof. 

Also provided is a method of binding, activating, or identifying a 
promoter which includes at least one SURE element and\or W box 
element, which method employs the step of contacting said promoter 
with a transcription factor as described herein. Where such method 
is performed in vivo, it will be a heterologous transcription 
factor. 

Activation of a promoter may be confirmed by use of a reporter gene 
e.g. as described in the Examples herein. 

Also provided is a method of identifying a sugar responsive element 
in a promoter, which method employs the step of contacting said 
promoter with a transcription factor as described herein. Where 
such method is performed in vivo, it will generally be a 
heterologous transcription factor. The contacting step is 
optionally followed by detection of binding to the promoter or 
region of the promoter. For example, elements within promoters may 
be characterised by use of EMS A or DNAase foot-printing techniques 
e.g. as described in the Examples herein. 

The foregoing discussion has been generally concerned with uses of 
the nucleic acids of the present invention for production of 



functional SUSIBA2 polypeptides in a plant. 



However the information disclosed herein may also be used to reduce 
the activity or levels of such polypeptides in cells in which it is 
desired to do so. For instance the sequence information disclosed 
herein may be used for the down-regulation of expression of genes 
e.g. using anti-sense technology (see e.g. Bourque, (1995) , Plant 
Science 105, 125-149); sense regulation [co-suppression] (see e.g. 
Zhang et al., (1992) The Plant Cell 4, 1575-1588). Further options 
for down regulation of gene expression include the use of 
ribozymes, e.g. hammerhead ribozymes, which can catalyse the site- 
specific cleavage of RNA, such as mRNA (see e.g. Jaeger (1997) "The 
new world of ribozymes" Curr Opin Struct Biol 7:324-335. Nucleic 
acids and associated methodologies for carrying out down-regulation 
(e.g. complementary sequences) form one part of the present 
invention . 

The present invention also encompasses the expression product of 
any of the functional nucleic acid sequences disclosed above, plus 
also methods of making the expression product by expression from 
encoding nucleic acid therefore under suitable conditions , which 
may be in suitable host cells. 

Following expression, the recombinant product may, if required, be 
isolated from the expression system. Generally however the 
polypeptides of the present invention will be used in vivo (in 
particular in planta) . 

Modified starches 

The present inventors have used confirmed the importance of SUSIBA2 
in regulation of starch synthesis by use of antisense 
oligodeoxynucleotide (ODN) technology (Dagle, J.M. & Weeks, D.L. 
Differentiation 69, 75-82) . The ectopic expression of isol and 
sbellb was followed in intersected sucrose -treated barley leaves 
after addition of an 18-mer antisense ODN against suseba2 . RNA- gel 
blot analysis revealed that exposure to the antisense ODN 
suppressed isol and sJbeJXb expression, providing functional 
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evidence that SUSIBA2 is relevant for isol and sJbeJXJb activities. 
Additionally the inventors investigated the effects of the 
antisense ODN technology on starch composition. The antisense 
suseba2 ODNs lead to a significant increase in iodine staining, 
which is indicative of fewer or shorter branches in the starch 
molecules, consistent with reduced sbellb and isol activities. Also 
the solubility properties of the starch were altered, as judged by 
increased turbidity (results not shown) . 

Thus a further aspect of the present invention is a polysaccharide 
generated {in vivo) by a process comprising manipulation of a 
SUSIBA2 transcription factor as described above. Also embraced is 
starch produced in the transformed plants and cells discussed 
above. Such starch is preferably derived from amylopectin but has 
any of a decreased, increased or otherwise altered degree of 
branching, with a corresponding alteration in properties e.g. 
solubility, swelling or ability to form a paste rather than a gel 
when heated in water. Commodities (e.g. foodstuffs) comprising 
such starches form a further aspect of the present invention. 

Other commodities which may benefit from the modified starches of 
the present invention include biodegradable plastics; food- 
processing thickeners; starch coated films, papers & textiles; 
paint thickeners; mining explosives; pharmaceuticals and glues. The 
modified starches can be used analogously to prior art starches in 
these materials, in ways which are well known to those skilled in 
the respective technical fields. 

SUSIBA2 antibodies 

Purified SUSIBA2 protein produced recombinantly by expression from 
encoding nucleic acid therefor, may be used to raise antibodies 
employing techniques which are standard in the art. Antibodies and 
polypeptides comprising antigen-binding fragments of antibodies 
form a further part of the present invention, and may be used in 
identifying homologues from other plant species. 

Methods of producing antibodies include immunising a mammal (e.g. 



mouse, rat, rabbit, horse, goat, sheep or monkey) with the protein 
or a fragment thereof- Antibodies may be obtained from immunised 
animals using any of a variety of techniques known in the art, and 
might be screened, preferably using binding of antibody to antigen 
of interest. 

For instance, Western blotting techniques or immunoprecipitation 
may be used (Armitage et al, 1992, Nature 357: 80-82). Antibodies 
may be polyclonal or monoclonal. 

Antibodies may be modified in a number of ways. Indeed the term 
"antibody" should be construed as covering any polypeptide having a 
binding domain with the required SUSIBA2 specificity. Thus, this 
term covers antibody fragments, derivatives, functional equivalents 
and homologues of antibodies, including any polypeptide comprising 
an immunoglobulin binding domain, whether natural or synthetic. 
Chimaeric molecules comprising an immunoglobulin binding domain, or 
equivalent, fused to another polypeptide are therefore included. 
Cloning and expression of Chimaeric antibodies are described in EP- 
A-0120694 and EP-A-0125023 . It has been shown that fragments of a 
whole antibody can perform the function of binding antigens. 
Examples of binding fragments are (i) the Fab fragment consisting 
of VL, VH, CL and CHI domains; (ii) the Fd fragment consisting of 
the VH and CHI domains; (iii) the Fv fragment consisting of the VI 
and VH domains of a single antibody; (iv) the dAb fragment (Ward, 
E.S. et al., Nature 341, 544-546 (1989) which consists of a VH 
domain; (v) isolated CDR regions; (vi) F(ab')2 fragments, a 
bivalent fragment comprising two linked Fab fragments (vii) single 
chain Fv molecules (scFv), wherein a VH domain and a VL domain are 
linked by a peptide linker which allows the two domains to 
associate to form an antigen binding site (Bird et al, .Science, 
242, 423-426, 1988; Huston et al, PNAS USA, 85, 5879-5883, 1988); 
(viii) bispecific single chain Fv dimers (PCT/US92/09965) and (ix) 
"diabodies", multivalent or multispecif ic fragments constructed by 
gene fusion (WO94/13804; P Holliger et al Proc. Natl. Acad. Sci. 
USA 90 6444-6448, 1993). 

Candidate SUSIBA2 polypeptides may be screened using these 
antibodies - e.g. by screening the products of an expression 
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library created using nucleic acid derived from an plant of 
interest, or the products of a purification process from a natural 
source. A polypeptide found to bind the antibody may be isolated 
and then may be subject to amino acid sequencing. Any suitable 
technique may be used to sequence the polypeptide- either wholly or 
partially (for instance a fragment of the polypeptide may be 
sequenced) . Amino acid sequence information may be used in 
obtaining nucleic acid encoding the polypeptide, for instance by 
designing one or more oligonucleotides (e.g. a degenerate pool of 
oligonucleotides) for use as probes or primers in hybridization to 
candidate nucleic acid, or by searching computer sequence 
databases, as discussed further below. 

The invention will now be further described with reference to the 
following non-limiting Figures and Examples. Other embodiments of 
the invention will occur to those skilled in the art in the light 
of these. 

FIGURES 

Figure 1. Structure of the suseba2 cDNA from barley. 

The 18-nucleotides long sequence complementary to the human IGFl 
receptor gene is underlined. The amino acid sequence of the tryptic 
fragment obtained by microsequencing from the overproduced SUSIBA2 
(see Figure 3) is shown by an open box. 

Figure 2. Analysis of the SUSIBA2 Primary Structure. 

The amino acid sequence of SUSIBA2 was aligned with those of the 
nine closest matching proteins from a BLAST search. Identical amino 
acids are shown in back boxes, and similar amino acids are shown in 
grey boxes. The WRKYGQK peptide stretch is shown in pink. Putative 
nuclear localization signals are indicated with an orange # sign 
under the sequences. Ser- and Thr-rich regions are underlined in 
red. The zinc-finger-like motifs in the two WRKY domains are 
indicated with green vertical arrows under the sequence. A. t. hyp, 
Arabidopsis hypothetical protein; A. t. put, Arabidopsis putative 
protein; A. t. unk, Arabidopsis unknown protein; A. t. wrky20, 



Arabidopsis WRKY transcription factor 20; H. v. SUSIBA2, barley 
SUSIBA2; I. b. SPF1, sweet potato SPFl; L. e. hyp, tomato 
hypothetical proteins; N. t. wrky, tobacoo WRKY protein; R. r. d-i, 
white broom drought-induced protein. 

Figure 3. Purification of SUSIBA2 . 

His-tagged SUSIBA2 was subjected to FPLC purification on a Ni 
affinity column following overexpression of the susiba2 cDNA 
construct in E. coll. Amino acid sequences obtained by 
microsequencing of tryptic fragments are indicated. 

Figure 4. Binding of SUSIBA2 to Restriction Fragments Containing 
the SP8a Element. 

(A) A portion of the isol promoter and restriction fragments 
spanning the SP8a element and flanking sequences. 

(B) EMSA using the different SP8a-containing restriction fragments 
incubated in the presence (+) or absence (-) of SUSIBA2 . The DNA- 
SUSIBA2 complexes are indicated by arrows. 

Figure 5. DNase I Footprint of SUSIBA2 to the isol Promoter Region. 

(A) The DNA fragment (-736 to -537) was end-labeled and applied to 
Maxam-Gilbert sequencing. The sequencing reaction was used as 
markers (M) . DNAse I digestion was performed in presence (+) or 
absence (-) of SUSIBA2 . Positions, relative to the transcriptional 
start site, in the Isol promoter are shown on the left. The 
protected region (-603 to -545) is indicated by an open box. 
Regions corresponding the SURE-a, SURE-b, and SURE-c sequences are 
designated as a, b, and c, respectively. 

(B) Alignments of sequences from the protected region with the SURE 
element from the potato patatin promoter (Grierson et al., 1994) . 

Figure 6. Binding of SUSIBA2 to Oligonucleotides Containing the 
SP8a, SURE, or W box Elements. 

(A) The sequence of the three oligonucleotides. The SP8a, SURE and 



26 

W box elements are shown in bold. 

(B) EMS A using the different oligonucleotides incubated in the 
presence ( + ) or absence (-) of SUSIBA2 . The DNA-SUSIBA2 complexes 
are indicated by arrows. 

Figure 7. Binding Competition Assay for SUSIBA2 with 
Oligonucleotides Containing the SURE or W box elements. 

EMSA using the SURE or W box oligonucleotides incubated in the 
presence ( + ) or absence (-) of SUSIBA2 r and without (-) or with 
(SURE or W box) a 100-fold molar excess of non-labeled competitor 
oligonucleotides . 

Figure 8. Temporal Expression Profile for susiba2 and isol. 

(A) RNA gel blot analysis of susiba2 and isol gene activity during 
barley endosperm development. Total RNA was isolated from barley 
endosperm on different days after pollination (d.a.p.) and used for 
RNA gel blot analysis. The blots were hybridized with probes 
specific for susiba2 t Lsol, or pazl. The sizes for the hybridizing 
fragments were around 2 . 4 kb for the sus±ba2 probe, 2.8 kb f or ' isol 
probe, and 1.5 kb for the pazl probe. 

(B) TLC analysis of endogenous sucrose levels in developing barley 
endosperm from different d. a. p. A sucrose extract from 0.2 mg dry 
weight of developing endosperm from each time point was applied to 
TLC analysis. 

Figure 9. Ectopic Expression of susiba2 and isol in Barley Leaves. 

Barley leaves were treated with exogenous sucrose at the 
concentrations indicated. Total RNA was isolated and hybridized to 
probes specific for suslba2 f isol, rbcS, and rbcL. 

Figure 10. Effects of the SURE-b ODN Decoy on isol Expression in 
Transformed Barley Endosperm. 



(A) Schematic representation of the isolngfp construct (upper 
panel) and the SURE-b ODN decoy (lower panel) • 



(B) Transient expression assays of GFP fluorescence in transformed 
barley endosperm cells. Barley endosperm cells were transformed by 
micropro jectile bombardment in absence {-) or presence ( + ) of the 
ODN decoy. For cotrans formation, the Isolngfp construct and ODN 
decoy were mixed at a molar ratio of 1:100. 

(C) Average number of fluorescent foci per plate from nine 
independent transformation events. 

Figure 11. Primary Structure Comparison between SUSIBA2 , the Rice 
and Wheat SUSIBA2 Orthologs, and SPF1 . 

(A) Alignment of the amino acid sequences between the conserved 
WRKY domains of SUSIBA2, the Rice and Wheat SUSIBA2 Orthologs, and 
SPF1. 

(B) Alignment of the entire amino acid sequence of SUSIBA2 with 
that of the rice SUSIBA2 ortholog. The portion corresponding to the 
isolated rice cDNA clone is denoted with a hatched double line 
under the sequences, with green indicating the aligned regions in 

(A). The "SUSIBA2-specific" sequence (c.f. Figure 2) is indicated 
by a stretch of blue # signs. 

Figure 12. Schematic Comparison of SUSIBA2 and SUSIBAl . 
SUSIBA1 and the C-terminal half of SUSIBA2 share > 95% amino acid 
identity over most of their lengths. The exception is the divergent 
N-terminus of SUSIBAl. The conserved WRKY domains and the nuclear 
localization signals (NLS) are indicated. 

Figure 13. Model of the Interactions between the SUSIBAs and the 
isol Promoter. 

SUSIBA2-70 binds to the SURE and W box elements, whereas SUSIBA2-68 
binds only to the W box. Neither of the SUSIBA2 forms binds to the 
SP8a element. Note that we do not know whether SUSIBA2-68 exists in 
vivo. 

Figure 14. SURE and SURE-like sequences in plant promoters. 
Alignment of SURE sequences in promoters of plant genes known to be 
subjected to sugar induced expression. Selected A and T nucleotides 
are highlighted in red and green boxes, respectively. SURE-c, SURE- 
b, and SURE-a refer to the SURE sequences of barley isol (this 
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work); sbellb, barley sbellb (Sun et al., 1998); ssl, barley ssl 
(Gubler et al., 2000); agpaseS, barley agpaseS (Thorbj0msen et 
al., 2000); amy, Arabidopsis gene for B-amylase (Mita et al., 
1995); sbel, maize sbel (Kim and Guiltinan, 1999); sus4 f potato 
gene for sucrose synthase (Fu et al., 1995); vsp, soybean gene for 
vegetative storage protein (Rhee and Staswick, 1992); PI-II, potato 
gene for proteinase inhibitor II (Kim et al., 1991); ps20, potato 
patatin class I gene (Grierson et al., 1994). 



EXAMPLES 

Example 1 - Isolation and Characterization of the suslba2 cDNA 

Primers were designed for RT-PCR amplification of barley cDNA using 
total RNA isolated from developing barley endosperm 9 days after 
pollination (d.a.p.). Primers combinaed information from sweet 
potato, cucumber and parsley transcription factor sequences 
(Ishiguro and Nakamura, 1994; Kim et al . , 1997; Rushton et al., 
1996) . Surprisingly, one PCR product of an appropriate size was 
obtained (data not shown) . This is particularly unexpected since 
given that, for instance, there are a large number (>70) of WRKY 
genes in Arabidopsis, and also that WRKY domains are closely 
conserved. The choice of developing endosperms may have 
contributed to there only being a single PCR product. 

The amplified product was used as a probe to screen a 10 d.a.p. 
endosperm cDNA library and a leaf library. From screening of 1 x 
10 6 p.f.u. 14 positive clones were recovered. All positive clones 
were subjected to subcloning, restriction mapping and sequencing 
analysis . 

One clone was completely sequenced on both DNA strands. The length 
of the sus±ba2 cDNA is 2355 nucleotides, and the open reading frame 
starts at position 247 and ends at position 1966 (Figure 1) . 

The cDNA encodes a 573-amino acids long polypeptide, SUSIBA2 , with 
a calculated molecular mass of 62.2 kDa, and it has 5' and 3'- 
untranslated regions and a poly (A + ) tail sequences. These 
findings, together with the results from RNA gel blot analyses (see 



below), suggest that the susiba2 cDNA is full-length. One 
interesting feature of sus±ba2 is that an 18-nucleotides long 
sequence between nucleotides 1051 and 1068 (Figure 1) is 100% 
complementary to the gene for human insulin-like growth factor 1 
(IGF1) receptor (nucleotides 173-190) . 

The deduced amino acid sequence of SOSIBA2 was used in a BLAST 
search of the NCBI databases. Alignment of the 9 most top scoring 
matches with the SUSIBA2 sequence demonstrated that all ten 
proteins share a high degree of similarity and that SUSIBA2 belongs 
to group 1 of the WRKY super family of plant transcription factors 
(Figure 2) . 

As would be expected from transcription factors, WRKY proteins 
contain nuclear localization signals (NLSs) . The two most common 
NLSs are monopartite signals, which are short stretches enriched in 
basic amino acids, and bipartite signals, which are composed of two 
short basic stretches separated by a spacer (Merkle, 2001) . The 
monopartite NLS depicted for WRKY proteins is conserved in SUSIBA2 
(amino acids 328-331; Figure 2) . However, a closer inspection of 
the sequences suggests that the monopartite NLS in SUSIBA2 and 
other WRKY proteins might also constitute a bipartite NLS close to 
the consensus, KR- (24-74)— R. .RK, for early auxin-inducible proteins 
(Abel and Theologis, 1995) . One putative bipartite NLS in SUSIBA2 
is KR-(66)-RK (amino acids 328-397). The other possibility is KR- 
(37)-R.RK (amino acids 328-370) and involves the WRKYGQK sequence 
itself in the C-terminal WRKY domain. The function of the WRKYGQK 
heptapeptide is unknown and it cannot be ruled out that it 
participates in the nuclear localization process, although a 
bipartite NLS arrangement for the N-terminal WRKYGQK sequence is 
not found. The most distinguishing feature of SUSIBA2 relative the 
other WRKY proteins is the 37-amino acids long insertion following 
the C-terminal WRKY- domain (amino acids 435-471; Figure 2) . A BLAST 
search with the. insertion sequence did not show any matches in the 
NCBI databases (however, see below) , suggesting that SUSIBA2 is a 
novel type of WRKY protein. 

The two WRKY domains (Eulgem et al., 2000) with their zinc finger 
motifs are highly conserved in SUSIBA2. Also the N-terminal Ser- 



and Thr-rich regions that are present in many WRKY proteins are 
found in SUSIBA2. The function of these regions is not known but is 
likely to involve regulation. Ser- and Thr-rich regions are 
characteristic of activation domains in transcription factors. It 
is generally believed that the C-terminal WRKY domain mediates 
sequence-specific binding to the cognate DNA elements, whereas the 
N-terminal WRKY domain facilitates DNA binding or engages in 
protein-protein interactions (Eulgem et al . , 2000). The zinc 
fingers in each WRKY domain might be involved in binding to either 
DNA or proteins . 

The WRKY class of proteins seem to be specific for the plant 
kingdom and are best known for their participation in various 
stress responses and senescence (Eulgem et al., 2000). Our results 
demonstrate that their sphere of activities has to be expanded to 
also include carbohydrate anabolism. It is probable that the 
unique properties of SUSIBA2 , as compared to other, hitherto known, 
WRKY proteins, are attributable to regions outside of the two WRKY 
domains, where the amino acid sequences are relatively divergent 
(Figure 2) . 

Example 2 - Overproduction of SUSIBA2 in E. coll 

The susiba2 cDNA was inserted into a pET expression vector and 
overexpressed in E. coll after IPTG induction. The overproduced 
protein was purified by a one-step FPLC on a His-Tag affinity 
column. Results from a typical overproduction and purification 
procedure are shown in Figure 3. 

The purified polypeptides were used for microsequencing, activity 
characterization and antibody production. 

The purified protein fraction contained three polypeptides. They 
were separated on SDS-PAGE ^nd subjected to in situ trypsin 
digestion. The digested fragments from each polypeptide were 
separated on HPLC and applied to mass-spectrometry analysis. The 
upper two bands corresponded to polypeptides with molecular masses 
of around 70 kDa and 68 kDa, respectively, and with the identical 
sequence LAGGAVP* This sequence is found in the deduced amino acid 



sequence of SUSIBA2 (Figure 1; amino acids 235-241) . The lower band 
corresponded to a 28-kDa polypeptide with the sequence of EIAKQA. 
This sequence matches E. coll trehalose-6-p synthase (amino acids 
234-239) . This enzyme copurified with the SUSIBA2 polypeptides also 
during repeated runs on the Ni column. It is therefore conceivable 
that the copurif ication was due to an unspecific association 
between SUSIBA2 and E. coll trehalose 6-P synthase, which may 
reflect an interaction of biological significance between SUSIBA2 
and barley trehalose 6-P synthase. A role for trehalose and 
trehalose- 6-phosphate in plant sugar signaling has been proposed 
(Goddijn and Smeekens, 1998). 

Thus purified SUSIBA2 appeared in two forms, SUSIBA2-7 0 and 
SUSIBA2-68, of approximately 70 and 68 kDa, respectively (Figure 
3). Possibly, SUSIBA2-68 is a truncated version of SUSIBA2-70. This 
notion is supported by preliminary results, which indicate that the 
level of SUSIBA-70 increases at the expense of SUSIBA2-68 with 
increasing concentrations of protease inhibitors . 

Since the purified SUSIBA2 still contains the N-terminal His-Tag, 
it seems less likely that any truncation should be at the N- 
terminal as that would leave too few His on SUSIBA2-68 to bind to 
the Ni column. Alternatively, the size difference between SUSIBA2- 
70 and SUSIBA-68 is an effect of posttranslational modification. 
The two SUSIBA2 polypeptides were extracted and used for production 
of polyclonal antibodies. 

Example 3 - Characterization of SUSIBA2 DNA-Binding Activity 

As an initial test of the DNA-binding activity of SUSIBA2, we 
performed electrophoresis mobility shift assays (EMSAs) with three 
overlapping nucleotide fragments from the barley Isol promoter, all 
containing the SP8a sequence (Figure 4) . 

The 822-bp long Sphl/Haell fragment, encompassing SP8a and flanking 
sequences, as well as the 437-bp long BspHI/Haell fragment, 
containing SP8a and downstream flanking sequence, formed prominent 
DNA-protein complexes with the purified SUSIBA2 fraction. Much to 
our surprise, however, the 510-bp long Sphl/Bsal fragment, 
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containing SP8a and upstream flanking sequence, bound only very- 
poorly to SUSIBA2. We concluded that, in fact, the target for 
SUSIBA2 in the isol promoter is not SP8a but one or more sites 
downstream of the SP8a element- From further analysis of the 
segment downstream of SP8a in the BspHI/Haell fragment we found 
that it harbors the sequence AATACCAAAAAATAATAATAAAA (nucleotides - 
568 to -545, relative the transcriptinal start site), which shares 
a high degree of identity with the SURE (sugar responsive) element, 
first reported by Grierson et al. (1994) from work on the potato 
class 1 patatin promoter. To follow up on this observation, we 
carried out EMS As with oligonucleotides consisting of the isol SURE 
sequence. SUSIBA2 exhibited strong binding activity with the SURE 
probe, whereas binding to the SP8a probe was negligible (Figure 6) . 

Since SUSIBA2 is a WRKY protein, it would be expected also to bind 
to the W box, the highly conserved binding site for WRKY proteins 
(Eulgem et al . , 2000) . A sequence identical to the consensus 
sequence for the W box, TGACT (nucleotides -400 to -396) , was 
located in the isol promoter, and the isol W box sequence was 
included in the binding assay. With the W box probe, two DNA- 
SUSIBA2 complexes could be discerned. Thus, the results supported 
our earlier unexpected finding that SUSIBA2 does not bind to the 
SP8a element, and showed that, instead, it interacts with the SURE 
and W box elements. 

To further illustrate that SUSIBA2 binds to the SURE element in the 
isol promoter, we ran a DNasel footprinting assay. A 210-bp long 
Nrul/Xbal fragment, encompassing the SURE element, was incubated 
with purified SUSIBA2, followed by DNasel digestion (Figure 5) . 

As expected, the protected region covered the SURE element but also 
the upstream and downstream flanking sequences were protected. In 
searching for an explanation we found that two additional SURE-like 
sequences, one immediately upstream, CCGAAAAAAACTAAGAAAGACCGATG 
(nucleotides -603 to -578) and one immediately downstream, 
AAAAAAT AAAGAAAAT GAAAT C (nucleotides -514 to -493), of the one 
located initially, are present in the isol promoter. We refer to 
these SURE sequences as SURE-a, SURE-b and SURE-c for the upstream, 
middle and downstream sequences, respectively. A conceivable 



33 



scenario is that SUSIBA2 binds to all three SURE sequences, 
yielding a protected segment of >110 nucleotides. This would likely 
explain why the protection continued beyond the 3 r end of the 
restriction fragment (Figure 5A) . As is also suggested by the 
footprinting assay, the SUSIBA2-DNA interaction is stronger for 
SURE-b than for SURE-a. An attempt to carry out a footprinting 
assay with the 437-bp long BspHI/Haell fragment (Figure 4A) , 
covering all three SURE sequences, produced poor quality results. 
This is not surprising since the recommended restriction fragment 
lengths for DNase I footprinting is 250 bp or less. 

Having demonstrated that SUSIBA2 binds to the SURE and W box 
elements in the isol promoter, we were interested in studying the 
relationship between the two binding activities. To this end we 
performed a competition experiment where the same amount of SUSIBA2 
was incubated with the SURE or W box probe (see Figure 6), in the 
presence or absence of a 100-fold molar excess of non-labeled 
competitor. The EMSA revealed that the W box competitor could 
efficiently outcompete the SURE probe from SUSIBA2 , and that the 
SURE competitor could outcompete the W box probe from the larger of 
the two SUSIBA2 complexes (Figure 7) . It is logical to assume that 
the large and small complex contains the SUSIBA2-70 and SUSIBA2-68 
polypeptides, respectively. Therefore, we interpret these results 
to mean that SUSIBA2-70 binds to both the W box and the SURE 
element, whereas SUSIBA2-68 binds only to the W box. 

A model depicting the binding properties of SUSIBA2 with respect to 
the isol promoter is shown in Figure 13. As has been demonstrated 
above, purified SUSIBA2 binds strongly to the SURE element and the 
W box but only poorly to the SP8a motif. Binding to the W box is 
consistent with SUSIBA2 being a WRKY protein, whereas binding to 
the SURE element displays a novel feature for a WRKY protein. The 
results herein suggest that the extreme C-terminal may be required 
for SURE-specific interactions. Another region that merits 
attention is the insertion in SUSIBA2 right after the C-terminal 
WRKY domain (Figures 2 and 11) . 

Example 4 - Expression of susiba2 and isol correlates with Sucrose 
Levels 
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In previous investigations we have shown that expression of the 
barley isol gene is sugar inducible (Sun et al., 1999). In light of 
the different reports pointing to the involvement of the SURE 
element in sugar signaling (Fu et al., 1995; Grierson et al., 1994; 
Kim and Guiltinan, 1999; Kim et al., 1991; Li et al, 2002; Mita et 
al., 1995; Rhee and Staswick, 1992), and the results here, showing 
the SURE-binding activity of the SUSIBA2 transcription factor, it 
was of interest to study the expression pattern of the suslba2 
gene. For examination of the temporal expression, barley endosperms 
were collected 7 to 27 d.a.p. and total RNA was isolated. RNA gel 
blot analysis was performed with susiba2- or isol-specif ic probes, 
or with a PCR-amplif ied probe for the pazl gene, encoding the 
storage protein Z (S0rensen et al . , 1989), which was included for 
comparison. The expression level for susiba2 was low compared to 
isol but the patterns were similar, with a peak at around 12 d.a.p. 
(Figure 8A) . 

A similar time course was obtained also for the barley sbella and 
sbellb genes (Sun et al., 1998). Expression of the pazl gene 
increased up to or beyond 22 d.a.p. and then sharply declined, 
consistent with the results by S0rensen et al . (1989). To study the 
correlation between sucrose and susiba2 expression, we monitored 
the endogenous sucrose concentrations during endosperm development 
by TLC. The sucrose concentrations were determined in seeds from 
the same batches used for transcript analysis. A comparison between 
the temporal expression pattern for susiba2 and Isol transcript 
accumulation and sucrose levels showed a very good correlation 
(Figure 8B) . 

No susiba2 transcripts could be detected in normal barley leaves . 
However, ectopic sus±ba2 expression in leaves was achieved after 
supplying exogenous sugar (Figure 9A) . The SUSIBA2 which was 
produced in sugar-treated leaves where it could be localized to the 
nucleus (data not shown) . 

Ectopic expression of isol, which agrees with previous results 
(Sun et al., 1999), was also achieved. As a comparison, the 
photosynthesis genes rbcS and rbcL, encoding, respectively, the 
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small and large subunit of RUBISCO, were abundantly expressed in 
control leaves (0 mM sucrose) . The nuclear rbcS was dramatically 
downregulated by sucrose while the plastidic rbcL was not notably 
affected. The expression studies were extended to the protein 
level, using the SUSIBA2 antibody in a protein gel blot analysis 
(Figure 9B) . The SUSIBA2 protein was found in endosperm but not in 
leaves . 

The data from the expression analyses is consistent with a role for 
SUSIBA2 as a positive regulatory transcription factor for isol 
activity. Experimental evidence for this view has recently been 
obtained by antisense inhibition studies (results to be presented 
elsewhere) . They also suggest that the expression of the sus±ba2 
gene itself is controlled via sugar signaling. 

From previous work, we learned that SP8a in the isol promoter is a 
negative regulatory element that recruits a repressor in non- 
expressing organs, such as embryos and leaves (Sun et al . , 1999). 
The data from the expression analyses here, and the affinity of 
SUSIBA2 for the SURE element, together strongly suggest that 
SUSIBA2 binds to the SURE element (s) in the isol promoter as a 
positive regulator. 

Example 5 - In vivo Analysis of Transformed Barley Endosperm 
Demonstrate the Relevance of the SURE Element for isol Promoter 
Activity 

To experimentally verify the premise that the SURE-b serves as an 
activating element for isol expression, we carried out transient 
assays in which the transcription of a chimeric isol-reporter 
construct was studied in transformed barley endosperm. In this 
experiment, the entire (1.5 kb) isol promoter was fused to the gfp 
reporter gene. The employment of the GFP reporter system for 
analyses of promoter activity has been successfully used in 
transformed barley embryos (Ahlandsberg et al . , 1999, 2002b). Since 
the isol gene is not expressed in embryos (Sun et al., 1999), a 
protocol for transformation of barley endosperm was established 
allowing us to carry out transient assays with gfp reporter 
constructs also in transformed barley endosperm. 



The activity of the isol promoter, manifested as GFP fluorescence, 
was followed in transformed barley endosperm in the presence or 
absence of a transcription factor oligodeoxynucleotide (ODN) decoy 
containing the SURE-b sequence (Figure 10A) . 

The ODN decoy approach involves flooding the cells with enough 
double-stranded decoy to compete for binding of transcription 
factors with their consensus sequences in target genes. If present 
in high-enough concentrations, these decoys can negate the ability 
of the transcription factor to regulate gene expression. The 
transcription factor ODN technology is widely used in medical 
investigations (Mann and Dzau, 2000) . Here, we choose to adopt the 
strategy for our studies on barley endosperm. As is demonstrated in 
Figure 10B, the presence of the SURE-b decoy in the endosperm cells 
blocked most, if not all, isol promoter activity. As shown in this 
Figure, the expression of isol in transformed barley endosperm was 
mainly confined to the periphery of the endosperm, towards the 
aleurone layer. The significance of this finding, if any, is 
unclear. The endosperms used for transformation were from seeds 20 
d.a.p., which is near maturity. Since programmed cell death of the 
starchy endosperm tissue in barley progresses from the center, the 
uneven distribution of isol activity might indicate that, at 20 
d.a.p., only endosperm cells along the aleuron layer are 
metabolically viable. Alternatively, the distribution reflects a 
sugar gradient within the endosperm. The transfer cells, which 
facilitate transport of photosynthate to the endosperm, are located 
at the basal endosperm, suggesting that, on sucrose medium, the 
sugar concentration in the endosperm will be highest at the 
vicinity of the transfer cells (for a recent review on endosperm 
development, see Olsen, 2001) . 

The average effect of three independent experiments is shown in 
Figure 10C. 

Example 6 - Isolation of cDNA for SUSABA2 Orthologs in Rice and 
Wheat 



In the light of the disclosure above characterising the involvement 



of the SUSIBA2 transcription factor in regulation of isol 
expression in barley endosperm/ it was plausible that SUSIBA2 
orthologs should be found in sink tissues also in other plants. The 
primers used for cloning of the suslba2 cDNA were employed for RT- 
PCR amplification of total RNA isolated from rice and wheat 
endosperm. For both species , only one major RT-PCR product was 
obtained (data not shown) . The PCR products were subcloned and 
sequenced. 

The deduced amino acid sequences from the PCR products include the 
region between the two WRKY domains, and analysis of these segments 
from the rice and wheat sequences show that they share a high 
degree of identity between themselves and with SUSIBA2 (Figure 
11A) . 

It is notable, apart from the conserved WRKY domains, the overall 
degree of identity between SUSIBA2 and the SP8a-binding-SPFl is 
rather low (28%; Figure 11) . 

The entire sequence for the rice ortholog could be retrieved from 
the TIGR Rice Genome Project (http://www.tigr.org/tdb/e2kl/osal/) 
on an unclassified entry with the temporary name 2017.t00012. Since 
the open reading frame for the rice clone could be obtained in its 
entirety, we were able to compare the complete primary sequence 
between SUSIBA2 and the rice ortholog (Figure 11B) . The two 
proteins are very similar, also in the extreme C- and N-termini 
(they share a high degree of amino acid identity (80%) along their 
entire lengths) . 

Notably, the "SUSIBA2-specif ic" C-terminal insertion (see Example 
3) is present in the rice sequence. 

Further investigations have identified SUSIBA2 orthologs also in 
potato and Arabidopsis (data not shown) . 

Example 7 - Discussion of relevance of SURE-element binding 

The importance of the SURE cis element for sucrose induction of 
gene activity in plants was first demonstrated by Grierson et al . 
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(1994) from work on the potato patatin promoter, and has been 
corroborated by a large number of studies in different plants (Fu 
et al., 1995; Kim and Guiltinan, 1999; Kim et al., 1991; Li et al, 
2002; Mita et al., 1995; Rhee and Staswick, 1992). In some of these 
studies the assignment of the SURE sequence as a regulatory element 
was inferred from experimental data, i.e. binding assays, whereas 
in other cases it was based primarily on computational analyses. 
Here we show that two (and possibly three) SURE elements are 
located also in the barley isol promoter and that they provide 
binding sites for the SUSIBA2 transcription factor. Its presence in 
the barley isol and maize sbel promoter suggests that the SURE 
element should also be found in other starch synthesis genes 
subject to sugar induction. A database search of the proximal and 
distal promoter regions of sequences revealed that, indeed, SURE- 
like sequences are present in promoters of several genes encoding 
enzymes involved in starch synthesis. Among those are barley 
sbellb, and the barley genes for starch SSI (ssl) and the small 
subunit of AGPase {agpaseS) . 

Although all three SURE sequences in the barley isol promoter, as 
well as those identified in other genes, bear a good resemblance to 
the patatin SURE sequence in pair-wise comparisons, it is difficult 
to arrive at a consensus sequence for the SURE element. An 
alignment of a subset of reported SURE sequences is shown in Figure 
14. 

The compilation also includes the SURE sequences from barley ssl 
and agpaseS. As can be seen, the SURE element is best described as 
an AT-rich box with the consensus core A/TAAANA, where N denotes 
any nucleotide. It is similar to the so-called A box response 
element in the wheat gbssl promoter (Kluth et al . , 2002). A/T-rich 
protein-binding sequences are common in both eukaryotic and 
prokaryotic genes, and certain protein motifs, such as SPKK, show 
preference for binding at A/T-rich sites (Churchill and Suzuki, 
1989) . A/T-rich regions might also be expected in some promoters 
since they facilitate DNA unwinding. The role of the A/T-rich box 
in the SURE element remains to be assessed. Certainly, DNA-protein 
interactions will be affected by the sequence context surrounding 
the A/T-rich region. This is illustrated by the presence of several 



A/TAAANA sequences in the DNase I hypersensitive region of the isol 
promoter upstream of -608 (Figure 5) . 

We noted that, in contrast to the barley sbellb promoter, which 
contains a SURE element (Figure 14) , no SURE-like sequence was 
found in the promoter of the barley sbella gene. Both sbellb and 
sbella contain a W box (data not shown) . We have reported 
previously (Sun et al . , 1998) that expression of barley sbellb is 
endosperm specific while that of sbella is not. We suggest that the 
W box in the isol, sbellb and sbella promoters serves as a general 
activating element while the sugar responsiveness conferred to Isol 
and sbeJJjb by the SURE elements contributes to their endosperm 
specificity (by the higher sugar levels in sink tissues as compared 
to embryos and vegetative tissues) . Further control of endosperm 
specific expression is probably exerted via binding of repressor 
proteins in non-expressing tissues. From earlier work, we concluded 
that repressors are recruited to the Sp8a element in the barley 
isol promoter (Sun et al . , 1999), and to the Bbl element in the 
second intron of barley sbellb (Ahlandsberg et al . , 2002b). 

The importance of the SURE element for isol expression is 
demonstrated by the decoy experiment (Figure 10) . The activity 
remaining in the presence of the SURE-b decoy could indicate that 
the decoy did not efficiently trap the SURE-binding activity. 
However, more likely, it illustrates that elements other than SURE 
are sufficient to maintain a basic level of isol activity. The 
involvement of the SURE element in regulation of isol expression is 
in line with the ectopic expression of isol in sugar-treated barley 
leaves (Sun et al . , 1999). The emerging transcription factor ODN 
technology offers a powerful and convenient way to assess 
transcription factor function and involvement of cis elements both 
in vitro and in vivo. The strategy has gained rapid popularity in 
animal sciences, e.g. in studies on gene therapy (Mann and Dzau, 
2000; Morishita et al., 1998). To our knowledge, the present report 
represents the first example of the transcription factor ODN 
technology in plant biology. 

SUSIBA2 is the first transcription factor known to bind to a SURE 
element, and, in fact, it is the first transcription factor 



reported for regulation of starch synthesis. Although binding of 
nuclear protein fractions to the SURE element in the potato patatin 

(Grierson et al., 1994) and maize sJbeJ (Kim and Guiltinan, 
1999) promoters , to the Sp8a sequence in the barley isol promoter 
(Sun et al., 1999), and to the Bbl element in the second intron of 
the barley sbellb gene (Ahlandsberg et al., 2002b) has been 
documented previously, the interacting trans factors have not been 
isolated or identified. 

METHODS USED IN EXAMPLES 

Plant Material 

Barley (Hordeum vulgare L. cv. Pongo), rice and wheat were grown in 
soil in a climate chamber under a 16/8 h photoperiod as described 
by Sun et al . (1998, 1999). 

Oligonucleotides 

The following oligonucleotide primers were used: 

Primer 1, 5 ' -CCAAGAAGTTATTACAAGTG-3 ' 

Primer 2 , 5 ' -TGGTTATGTTTTCCTTCGTA-3 ' 

Primer 3 , 5 ' -GGAATTCCATATGTCCCCCGCGCGGCTGCC-3 ' 

Primer 4, 5 ' -CGGATCCGGCTGAACTGACTTGTAAC-3 ' 

Primer 5, 5 ' -CCCTCGTGGAAGCAAAACTGTGTTTCTCGC-3 ' 

Primer 6 , 5 ' -GCGAGAAACACAGTTTTGCTTCCACGAGGG-3 ' 

Prime r 7 , 5 ' -GGAAAACCGAAATACCAAAAAATAATAATAAAATAATAAT- 3 ' 

Primer 8 , 5 ' -ATTATTATTTTATTATTATTTTTTGGTATTTCGGTTTTCC-3 ' 

Primer 9 , 5 ' -TCGCTAACCAGTGACTTCCACGTTTCATCATTTATT-3 ' 

Primer 10, 5 ' -AATAAATGATGAAACGTGGAAGTCACTGGTTAGCGA-3 ' 

Primer 11 , 5 ' -AT G ACT C GAGC AG AT T T T GG AT T GC T AAT G A- 3 ' 

Primer 12 , 5 ' -ATGACCATGGGCCACCTCGTGTTGGTTCTTCGT-3 

Computational Analyses 

Searches of the NCBI databases were performed with the BLAST 
service (http://www.ncbi.nlm.nih.gov/BLAST/). Alignments of 
nucleotide and amino acid sequences were carried out using the 
Clustal W Service at the European Bioinf ormatics Institute 
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( http: //www .ebi.ac.uk/clustalw) , except for alignment of sweet 
potato, cucumber and parsley SPF1 cDNA sequences, which was done 
with the MacVector® program (Accelrys Inc., USA), and alignment of 
the SURE element sequences, which were run on the T-Coffee server 
5 (http://www.ch.embnet.org/software/TCoffee.html). For presentation 
purposes, the ALN output from the Clustal W program was fed into 
the BOXSHADE server 

(http : //www . ch . embnet . org/sof tware/BOX_f orm. html . 
10 Isolation of cDNA Clones 

Primers 1 and 2 were constructed according to consensus sequences 
for the SPF1 and SPFl-like cDNA sequences. Total RNA was isolated 
from developing barley, rice and wheat endosperm according to Sun 

15 et al. (1999). First-strand cDNAs were produced as described 
(Frohman et al . , 1988) using primer 2. RT-PCR was carried out 
according to standard protocols (Sambrook et al., 1989) using 
primers 1 and 2. The PCR -products were subcloned and sequenced. The 
RT-PCR product from barley was used as a probe to screen a barley 

20 cDNA library from developing endosperm 10 days after pollination. 
Library construction and screening, subcloning, DNA gel-blot 
analysis and sequence analysis were as described (Sun et al . , 1998, 
1999) . 

25 Overproduction of SUSIBA2 In E. coll, Micro s equenc ing , Antibody 
Production and Protein gel Blot Analysis 

Two primers were designed, primer 3, with an Ndel restriction site, 
and primer 4, with a BamHI restriction site. These primers were 

3 0 used for PCR amplification of the susiba2 cDNA from the isolated 
full-length cDNA clone. The amplified cDNA was inserted into the 
expression vector pET-15b (Novagen Inc., Germany) between the Ndel 
and BamHI sites. The constructed plasmid was transformed into E. 
coli strain BL 21 (DE3) . Overexpression was carried out according 

35 to the manual provided by the manufacture, except the IPTG 

induction was performed at 37° C for 2 hrs. The overproduced 
protein was purified by FPLC™ on a column with a Ni + chelating 
resin. SDS-PAGE was run as described previously (Sun et al . 1997). 
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Trypsin digestion and microsequencing were done in collaboration 
with Amersham Pharmacia Biotech., Sweden. Antisera were produced by 
AgriSera AB, Sweden after immunizing rabbits with the Coomassie- 
stained SUSIBA-70 and -68 polypeptide bands excised from the SDS 
gel. Protein gel blot analysis was performed as before (Sun et al . 
1997) . 

Electrophoresis Mobility Shift Assay (EMSA) 

EMSAs were carried out essentially as described by Sun et al . 
(1999) . The competitor probes were prepared by annealing different 
non-labeled oligonucleotides in 14 mM Tris-HCl (pH 8.0), 7 mM 
MgCl 2 . 

DNase I footprinting 

Restriction fragments of the isol promoter were singly end-labeled 
and used together with purified SUSIBA2 and the Sure Track 
Footpring Kit (Amersham Pharmacia Biotech, Sweden) . All procedures 
were according to the manual provided by the manufacturer. 

RNA Gel Blot Analysis 

RNA isolation and RNA gel blot analysis were performed as described 
previously (Sun et al . 1998, 1999). 

Sucrose Isolation and Analysis, and Exogenous Sucrose Induction 

Sucrose isolation, TLC and sucrose induction of ectopic gene 
expression in barley leaves were carried out as described (Sun et 
al. 1998, 1999) . 

Transformation and GFP Assay of Barley Endosperm 

The 1504-bp long promoter region of the isol gene was PCR-amplif ied 
using primers 11 and 12 and fused to the gfp plasmid pNl473GFP as 
described by Ahlandsberg et al. (1999, 2002a), yielding the 
construct p48. The ODN SURE-b decoy was generated by annealing 
primers 7 and 8 in 14 mM Tris-HCl (pH 8.0), 7 mM MgCl 2 . 
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Transformation of barley endosperm by microprojectile bombardment, 
using a DuPont PDS 1000 He Biolistic Delivery System (Bio-Rad 
Laboratories, Hercules, CA, USA), and transient assay of GFP 
fluorescence were performed as described by Ahlandsberg et al. 
(2002a) with the following exceptions. Barley caryopses of 19-22 
d.a.p were bisected longitudinally and placed endosperm side up on 
DG3B medium, containing Murashige and Skoog medium (Murashige and 
Skoog, 1962) supplemented with 100 g l" 1 sucrose, 1.0 mg I" 1 
thiamine-HCl, 0.25 g l" 1 myo-inositol, 1.0 g l" 1 casein hydrolase, 
and 0.69 g l" 1 proline solidified by 5.0 g l" 1 Gelrite (Duchefa, 
Haarlem, The Netherlands), for two hours prior bombardment. Six 
bisected caryopses were placed in the center of a 20 x 90 mm Petri 
dish with DG3B medium, 13 cm below the stopping screen, and 
bombarded once using a 7.6 MPa rupture disc. After bombardment, the 
caryopses were kept for 48 hours in darkness on plates before GFP 
fluorescence assays . 

Accession numbers 

GenBank accession numbers referred to in this work are: Arabidopsis 
gene for ft-amylase, S7707 6; Arabidopsis hypothetical protein, 
T08930; Arabidopsis WRKY transcription factor 20, Q93WV0; 
Arabidopsis putative protein, NP567752; Arabidopsis unknown 
protein, AAK76566; barley agpaseS, AJ239130; barley isol f AF142589; 
barley sbellb, AF064563; barley ssl, AF234163; barley WRKY proteins 
implicated in drought tolerance, BM816210 and BM816211; E. coll 
trehalose-6-P synthase, S33584; human IGFl receptor, NM_000875; 
maize sJbeJ, AF072724; potato patatin class I gene, M18880; potato 
gene for proteinase inhibitor II, X04118; Potato gene for sucrose 
synthase, U24087; soybean gene for vegetative storage protein, 
M76980; sweet potato SPFl, S51529; tobacco WRKY protein, BAB61056; 
tomato hypothetical proteins, CAC36397 and CAC36402; white broom 
drought induced protein, AAL32033. 
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Sequence annex 

Rice susiba2 cDNA 

CGTTCGCTTGATGGTCAGATTACTGAAGTGGTTTATAAAGGGCGTCACAATCACCCTAAGCCCCAAC 
CCAATAGGAGGCTGTCTGCCGGTGCAGTTCCTCCAATCCAGGGTGAAGAAAGATATGATGGTGTGGC 
AACTACTGATGACAAATCTTCAAATGTTCTTAGCATTCTTGGTAATGCAGTACATACAGCTGGTATG 
ATTGAGCCTGTTCCAGGCTCAGCTAGTGATGATGACAATGATGCCGGAGGAGGGAGACCTTACCCTG 

GAG AT GAT GCT GT T GAG GAT GAT GAT T T AG AGT C AAAAC GAAGGAAAAT GGAAT CT GC T G CT AT TGA 
TGCTGCTTTGATGGGCAAGCCTAACCGTGAGCCTCGTGTTGTAGTACAAACGGTTAGTGAGGTTGAC 
ATCTTGGATGATGGGTACCGCTGGCGCAAGTATGGCCAGAAAGTAGTTAAAGGAAACCCCAATCCAC 
G GAGT T AC T AC AAGT GC AC AAATAC AGGAT GC C C AGT C AGGAAGCAT GT T G AGAGAGC AT C AC AT G A 
T CCAAAATCAGTCATAACAACAT ACGAAGG 

(the rice ortholog sequence can be retrieved from the TIGR Rice 
Genome Project (http://www.tigr.org/tdb/e2kl/osal/) on an 
unclassified entry with the temporary name 2017.t00012) 

Wheat susiba2 cDNA 

ACAAGTGCACACATCCTAATTGTGAAGTAAAAAAGCTATTGGAGCGTGCGGTTGATGGTCTGATCAC 
GGAAGTTGTCTATAAGGGGCGCCATAATCATCCTAAGCCCCAGCCTAATAGGAGGTTAGCTGGTGGT 
GCAGTTCCTTCGAACCAGGGTGAAGAACGATATGATGGTGCGGCAGCTGCTGATGATAAATCTTCCA 
ATGCTCTTAGCAACCTTGCTAATCCGGTAAATTCGCCTGGCATGGTTGAGCCTGTTCCAGTTTCAGT 
TAGTGATGATGACATAGATGCTGGAGGTGGAAGACCCTACCCTGGGGATGATGCTACAGAGGAGGAG 
GATTTAGAGTTGAAACGCAGGAAAATGGAGTCTGCAGGTATTGATGCTGCTCTGATGGGTAAACCTA 
ACCGTGAGCCCCGTGTTGTCGTTCAAACTGTAAGTGAGGTTGACATCTTGGATGATGGGTATCGTTG 
GCGGAAATATGGACAGAAAGTTGTCAAAGGAAACCCCAATCCACGGAGTTACTACAAATGCACAAGC 
AC AG GAT G CC C TG T GAG G AAG CAT GT T GAG AGAGC AT C GCAT GAT C C T AAAT C AG T G AT AAC AACGT 
AC GAAGGAAAAC AT AAC C A 

Wheat susiba2 peptide 

PRSYYKCTHPNCEVKKLLERAVDGLITEWYKGRHNHPKPQPNRRLAGGAVPSNQGEERYDGAAAAD 
DKSSNALSNLANPVNSPGMVEPVPVSVSDDDIDAGGGRPYPGDDATEEEDLELKRRKMESAGIDAAL 
MGKPNREPRVWQTVSEVDILDDGYRWRKYGQKVVKGNPNPRSYYKCTSTGCPVRKHVERASHDPKS 

VITTYE 
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1 agatcatgcatcgttgagccgcttggtggg 
31 cgagacggctcgatgggotccccgagcgggcacgcgggctcgatgaagtactcc 
85 ccgtgcttcacttctgcgccacggtgatct.cc-ttgggtcgggt:ccaagccgaaa 
139 cctagctccggggacctcccctcgcccgccgggagctcgcccgagaagccctac 
193 cccgcggatogacgcgtogcggcgctcgccggcgcgggcgcgaggtacaaggcc 
247 at gtcc ccc gcgc ggc tgcc gatctcgc gc gag t c ctgcctc acc atcccc gc c 

MSPARLPXSRESCLTIPA 18 
301 ggcttcagcccctcagcgctcctcgactcccccgtgctcctcaccaacttcaag 

GFSPSALLDSPVLLTHFK 36 
355 gttgaaccttcaccaacaactggtagtctgggcatggctgcgattctgcacaag 

V E P S P T_T GS It G M A A ILHK 54 
409 agcgctcatccagacatgctgccttcgccacgggataaatctgttcgtaatgcc 

SAH PDMXjP S P R D K S V R N A 72 
463 catgaagataggggttctagggattttgaatfccaagcctcatctgaattcgtct 

HEDRGSRDFEFKPHLNSS 90 

517 tctcaatcactggctcctgctatgagtgatctaaaaaaacatgagcattctatg 

S Q S I* A P A M S DL'K RHERSM 108 

57 1 caaaatcagagtatgaatcccagctcatcatctagcaatatggtgaatgaaaac 

QNQSMNPS S S S S HMVHEH 126 
625 agacctccctgttcacgtgagtcgagtcttacagtgaatgtaagtgctcagaac 

RPPCSRES SI.TVNVSAQN 144 
679 caacctgttggaatggtfcggtttgactgacagcatgcctgctgaagttggtaca 

QPVGMVGLTDSMPAEVGT 162 
733 tctgagccgcagcagatgaatagctctgacaatgccatgcaagagccgcagtct 

S EP QQMNS SD HAKQEPQS 180 
787 gaaaatgttgctgacaagtcggcagatgatggctacaactggcggaaatacggg 

EHVADKSADDGYNWRKYG 198 
841 cagaagcatgtcaagggaagtgaaaaccct.agaagttactacaagtgcacacat. 

QKHVKGSE N P R S Y Y K C T H 216 
8 95 cc taattgtgaagta a aa aa gctattg g age gt gc agttgat ggtct g atcacg 

P N. C E V K K It Ii E RAVDGLIT 234 
949 gaagttgtctataagggacgccacaatcatcctaagccccagcccaataggagg 

EVVYKGRHNHPKPQPHRR 252 
1003 ttagctggtggtgcagttcc ttcaaaccagggtgaagaacgatatga cggcgct 

|L A G G A V Pj SNQGEERYDGA 270 
1057 tx:agctqctgat ciataaatcttccaatgctcttagcaaccttgctaatccggta 

SAADDKS S HALSNI*ANPV 288 
1111 cattcgcctggtatggttgagcctgttccagct.tcagtt:agtgatgatgacatc 

HSPGMVEPVPASVSDDDI 306 
1165 gatgctggaggtggaagaccctaccctggggatgatgctacfcgaggaggaggat 

DAGGGRPY PGDDATEEED 324 
1219 ttagagtcgaaacgcaggaaaatggagtctgctggtattgatgctgcfcctgatg 

I, E S KRRKHESAGIDAALM 342 
1273 ggtaaacctaaccgtgagccccgtgtcgtcgttcaaactgtaagtgaggt.tgac 

,GKP HREPRVVVQTVSEVD 360 
1327 atcttggatgatggctatcgtfcggcggaaatatggacagaaagttgtcaaagga 

ILDDGYRWRKYGQKVVKG 378 
1381 aaccccaatccacggagttactacaaatgcacaagcacaggatgccctgtgagg 

NPNPRSYYKCTSTGCPVR 396 
1435 aagcatgttgagagagcatcacacgatcctaaat.cagtgataacaacgtatgaa 

KHVERASHDPKSVITTYE 414 
14 B 9 ggaaaacataaccatgaagtccctgctgcgaggaatgcaacccatgagatgtcc 

GKHHHEVP AARHATHEMS 432 
1543 gcgcctcccatgaagaacgtcgtgcatcagattaacagcaatatgcccagcagc 

AP PMKHVVHQIHSNKPSS 450 
1597 attggtggcatgatgagggcatgtgaagccaggaactacaccaaccaatattct 

I GGKMRAC EARNYTNQYS 468 
1651 caggcggctgaaaccgacactgtcagtcttgatcttggtgttggaatcagccca 

QAAETD TV SIjDIjGVGISP 486 
1705 aaccacagcgacgcgacaaaccaaatgcagtcttcaggtcctgaccagatgcag 

MHSDATNQMQSSGPDQMQ 504 
1759 tatcaaatgcaaaccatgggttcgatgtacggcaacatgagacatccatcatca 

YQMQTMGSMXONMRHPSS 522 
1813 atggcagcgccagcggtacaaggaaactctgctgcccgcatgtatggttcgaga 

MAAPAVQGNSAARMYGSR 540 
1867 gaagagaaaggtaacgaagggtttactttcagagocacaccgatggaccattca 

EEKGMEGFTFRATPMDHS 558 

1921 gctaacctatgctatagcagtgctgggaactfcggtcatgggtccatgagaggga 

AHLCYS SAGHLVMGP* 573 

1975 atgagagtgtctgcaaatgctcatagctccatgaatcatatattaccaacaatg 

2029 ctttgtaatgacaatctcttcagcaagattctcaattgtctcaattgtgtatcg 

2083 gttacaagtcagttcagccggaggcaagtatgctagtataagctatacgtgggg 

2137 gcactgcagcaaatacgcatgtgtctfctttaagtgcggaaaaggcccttgctgt 

2191 atgtagcatcgcagccctacattcgttgtacagcgaacctaatatgattaatta 

2245 attagattatgagaatttggtttcgtgaactgtctaatcttctgtactggaata 

2299 ttgatagaaatatagattatggtaattttctttaaaaaaaaaaaaaaaaaaaaa 

2353 aa 
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E, coli crude FPLC purification 
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Protected region -603 CCGAAAAAAS^^ 

PataU n SURE sequence AATACTAATAAAGA ATAGAAAAA 



Protected region -568 AR|AC*»ltflAA^^ -545 
Patatin SURE sequence 



Fig. 5 



6/14 




Fig. 6 
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Fig. 8 
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Fig. 10 
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Fig. 11 
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